Affle»<l«ie«ts to the Oairos 



1. (Currently Amended) A method of generating an index for a sequence that 
supports a non-contiguous subsequence match, comprising: 

receiving a sequence cam prising one or n)o^^ 
elements, the symbol elements corresponding to a non-uniform frequency 

receiving a window size; 

encoding the sequence into a weighted-sequence; 

encoding tlie weighted sequence into one or more one-dimensional sequences, 
wherein the length of each of the one or more one-dimensional sequences is less than 
the window size; 

tgQr<|enng the one or 
uniform frequ e ncy, digtrib.uti 

i n s er t i ng the new segu^^^^^ 
sequences as one or more trie nodes into a trie structure; and 

generating ati index, wherein generating the index comprises: 

generating a cujTent sequential ID and a snaximum sequential ID pair 

for each of the one or more trie nodes, wherein the current sequential I D of 
any descendaiit of a given trie node of the one or more trie nodes is between the 
current sequential ID of tlie given trie node and the maximum sequential ID; 
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generating an iso-deptt\ I'mk for each unique symbol in each of the one 
or more one-dimensional sequences, wherein the iso-depth link comprises trie 
nodes under tiie symbol; and 

generating an otBet iist comprising an original position for each of 
one or more subsecjuences in the weighted-sequence. 

2. (Previously Presented) The method of claim i , wherein encoding the 
sequence into the weighted-sequence comprises encoding the sequence with weights 
represented by real numbers. 

3. (Previously Presented) The method of claim 2, wherein encoding the 
sequence wi til weights represented by real numbers comprises discretizing the 
sequence into a number of equi -width units. 

4. (Ciin entiy Amended) The method of claim 1, wherein inserting the. new 
seguMgg <^t^>e> Qf -- ^ into the trie structure is 
performed using a depth-first traversal. 

5. (Previously Piesented) The method of claim 1, further comprising creati ng 
a weighted-sequences index, wherein the weighted-sequences index comprises an iso- 
deptlt index, wherein the iso-depth index is a one-dimensional buffer. 

6. (Previously Presented) The method of claim i , further comprising creating 
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a weighted-sequences index, wlierein the weighted-sequences index comprises an iso- 
depth index, wherein the iso-depth index is a B' tree. 

7. (Previously Presented) The method of claim 1, further comprising creating 

a weighted-sequences index, wherein the weighted-sequences index comprises an iso- 
depth index, wherein the iso-deptii index is a linked list 

S. (Caticelled) 

9. (Cancelled) 

10. (Cancelled) 

1 1 . (Currently Amended) The method of claim liO, wherein thereordering 
the gne or more one one - dimensional sequenees u si ng the non - unifo^^^^^ frequency 
di«Mki1je»togeneFatea*vewse<^»eepri^^^ 
on^dim e as t oaal se qu e ne es mieth e- triesinietur e, comprises: 

(a) adding an olTset 2*w'''r to each of the weight elenieius to generate a new 
weightj wherein w is a window size, and r is a frequency ratik for a symbol of each of 
the symbol elements; 

(b) sorting the pairs of symbol and weight elements by the new weight; 

(c) placing a moving window of size 2*w*A on the new sequence, wherein A 
is a total number of the symbols; and 
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(d) indexing the new sequence in a new window, 

12. (Previously Presented) The method of claim 1, wherein receiving the 
sequence comprises receiving one or more scientific datasets, transforming each of 
the one or more scientific datasets into one or more sequences, and concatenating the 
one or more sequences to form a long sequence. 

13. (Cancelled) 

14. (Cancelled) 

15. (Currently Amended) A program storage device readable by machine, 
tangibly embodying a program of instructions executable by the machine to perform 
method steps forof generating an index for a sequence that supports a non-contiguous 
subsequence matcJj, the method steps comprising: 

receiving a sequencejCcmi^iMJJgiM^LO^^ 
elements, the symbol elements corresponding to a non-uniform frequency 
distribution ; 

receiving a window size; 

encoding the sequence into a weighted-sequence; 

encoding tlie weighted sequence into one or more one-dimensional sequences, 
wherein the length of each of the one or more one-dimensional sequences is less than 
the window sixe; 

5 



reordenna tiie one or more one oae-diniensional sequences ,u§ing,,^e,nfta~ 
umibini trequenc),- distribution to generate a new sequence; 

as one or more trie nodes into a trie structure; and 

generated an index, wherein generating the index comprises; 

generating a current sequential ID and a maximum sequential ID pair 
for each of the one or more trie nodes, wherein the current sequential ID of 
any descendant of a given trie node of the one or more trie nodes is between the 
current sequenda! 11) of tl^e given trie node and the maximum secjuentia! ID; 

generating an iso-depth iink for each unique symbol in each of the one 
or more one-dimensional sequences, wherein the iso-depth iink comprises trie 
nodes under tJte symbol; and 

generating an offset Hst comprising an odginai position for each of 
one or more subsequcmces in the weighted-sequence. 



16. (Cancelled) 



17. (New) The program .stora ge dev ice of claim 15, w|m^^^ instroctions 
.fQA'..gD.coding.th^^^ 

18, (New) The piogram stoiage device of 17, u herein the inMEtctionsfor 
mco!^ng.^iesequence with weights represented by real numbers comprise 
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19. (New) Tlie^grogrmnjtoragedgvi,^^ 

20. (New) The program storage device of claim 15, further compri sing 
instructions for creating a weighted-sequences index, wherein the weiahted- 

21 . (New) The program storage device of d 
,iKStmctions.fo^^^ 

tree . 

22. (New)Tl]^.grogrMLStomgejde^ 

instaictions for creating a weighted-sequences index, wherein the weighted- 
sequences index comprises an iso-depth index, wherein the iso-depth index is a Linked 
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